33. A method for obtaining one or more candidate nucleotide sequences, the 
candidate nucleotide sequence being indicative of a sequence of a target 
polynucleotide Violecule T, T producing a hybridization signal l(x) upon 
incubating T with a polynucleotide x for each polynucleotide x in a set E of 
polynucleotides, the method comprising the steps of: 

(a) for each polynucleotide x in the set E of polynucleotides, obtaining a 
probability W*) of the hybridization signal when the sequence x 
is not complementary to a subsequence of T and a probability Pi(x) of 
the hybridization signal when the sequence x is complementary to a 
subsequence ofV; so as to obtain a probabilistic spectrum (PS) of T; 

(b) assigning a scor\ to each of a plurality of candidate nucleotide 
sequences, the score being obtained in a calculation using the 
probabilistic spectrum and at least one reference nucleotide sequence H; 
and \ 

(c) selecting one or more\candidate nucleotide sequences having an 
essentially maximal score. \ 

34. The method according to Claim 33, wherein the polynucleotides x in the 
set E are immobilized on surface. 

35. The method according to Claim 33, wherein the set E is a set of k-mers. 

36. The method according to Claim 35 wherein E is the set of all k-mers formed 
from nucleotides from a predetermined set of nucleotides. 

37. The method of Claim 36 wherein the predetermined set of nucleotides is 
selected from the group consisting of 
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(a) adenine, guanine, cytosine, and thymine; and 

(b) adenine, guanine, cytosine, and uracil. 

38. The method according to Claim 33, wherein the score of a candidate 
nucleotide sequence f is obtained in a calculation using V (f ) where 

£*(r)=nw*). 

wherein T(x)= 0 if the sequence of x is not complementary to a subsequence of 
f and f (x) = 1 if the sequence of 3c is complementary to a subsequence of f . 

39. The method according to Claim 33, wherein the score of a candidate 
sequence f is obtained in a calculation using L e {f ) where logL c (f )= }►>(«,)» 

A ( \ 1 P l( e .) 

wherein f contains polynucleotides e 0 ,...e m and co\e x .)- log-^-^ . 

40. The method according to Claim 33, wherein T and H have a common 
length. 

41 . The method according to Claim 40, wherein the score of a candidate 
sequence f is obtained in a calculation using D u (f ) where 
D u (f)=Yl M0) h> h j]> wherein M ^ frj» h jl is a probability of a nucleotide tj in 
position j of T being replaced with nucleotide hj in position j of H. 



42. The method according to Claim 41, wherein the score of a candidate 
nucleotide sequence f is Score u i (f ), or Score u 2 (f) where 
5core",(f)=logL e (f)+logZ)"(f) and Score\{f)=\ogL e {ty\o % D u {f). 



43. The method according to Claim 42 wherein the polynucleotides in the set E 
are k-mers and the step of selecting a candidate nucleotide sequence having an 
essentially maximal score comprises the steps of 

(a) for each (k-l)-mer y calculating S u \y 9 k-l] = ^L U) [y hj\ 

y=i 

(b) for each integer j = k, . . .1, 

(ba) for each polynucleotide sequence (y \ , . . . yk_ i ), 

(baa) calculating 

ST \y, j] = [y M , A ,]+ max Is" [z, j - 1]+ a>(e) } 

wherein iXD [y, hj]= log MO) [y, hj]. 

(bab) selecting a (k-l)-mer P[ j; J] 

satisfying 

e={z,y)<zE 

(c) selecting a (k-l)-mer Zl having a score essentially equal to max 

(d) for j=k-l,...,l-l; recursively calculating (k-l)-mers Z) where Z?" 
^=P[Z/j]and 

(e) selecting candidate target sequence <zk-lj 5 zk-^-z^k-b z k k-b 
zk + lj <> i v ..zl] <> i> 5 where zj=<z)i zJ^.-zJ]^^ . 

44. The method according to Claim 9, wherein the polynucleotides in the set E 
are k-mers, and the step of selecting a candidate nucleotide sequence having an 
essentially maximal score comprises the steps of: 

(a) if the length / of the target is greater than a predetermined length, setting 

l + k-\ 
m= 2 ' 

(b) for each j = k- 1, m, computing 5 u [y,y] according to Claim 11 for 
all y ;. 

(c) for each j = 1,1-1, .... m, computing R" \y,j] by initializing for each y 



R"\y,i\= 0 and looping over 
for all 5? ; 

(d) selecting y m = argmax\s u \y,m]+ R u \y,m] }; and 

peV ) 

(e) computing the optimal sequence aligned to (h x ...h m ) ending with y m , and 
the optimal sequence aligned to (h m ...h t ) beginning with y m . 

45. , The method according to Claim 33, wherein H and T have lengths such that 
the length of T is less than the length of H. 

46/ The method according to Claim 45, wherein the step of assigning a score to 
each of a plurality of candidate nucleotide sequences and the step of selecting the 
candidate target sequence are performed according to Algorithm B. 

47. The method according to Claim 33, wherein H and T have arbitrary lengths. 

48. The method according to Claim 47, wherein the step of assigning a score to 
each of a plurality of candidate nucleotide sequences and the step of selecting the 
candidate target sequence are performed according to Algorithm C. 

49. The method according to Claim 47, wherein the step of assigning a score to 
each of a plurality of candidate nucleotide sequences and the step of selecting the 
candidate target sequence are performed according to Algorithm D. 

50. The method according to Claim 49 wherein a Hidden Markov Model is 
used instead of a reference sequence. 

51. The method according to Claim 33, wherein the algebraic equation (12a') 
replaces the algebraic equation (12a), the algebraic equation (12b') replaces the 



algebraic equation (12b), the algebraic equation (15*) replaces the algebraic 
equation (15), and the algebraic equation (16') replaces the algebraic equation (16). 

52. The method according to Claim 45, wherein the algebraic equation (20') 
replaces the algebraic equation (20), and the algebraic equation (21') replaces the 
algebraic equation (21). 

53. The method according Claim 47, wherein the algebraic equation (29') 
replaces the algebraic equation (29), and the algebraic equation (30') replaces the 
algebraic equation (30). 

fy\ 54. The method according to any one of the previous claims wherein the target 
^/^J^omprises two or more polynucleotide molecules. 



55. The method according to Claim 33 computing the exact score V (f ) for 
several candidate sequences chosen according to the value of the approximated 

r-(t). 



score 



56. The method according to Claim 33 further comprising a step of deleting 
candidate sequences having likelihood below a predetermined score. 

57, The method according to Claim 33, wherein the score of a candidate 
nucleotide sequence f is obtained in a calculation using i/(f ) where 

^)=n%-)(*)> 

Jcg E 

wherein T(x) = r if the sequence of x is complementary to exactly r subsequences 
off. 



58. The method according to Claim 33, wherein the set E of polynucleotide 
does not include all the polynucleotides of a specific length. 



59. The method according to Claim 33, wherein the set E of polynucleotide 
includes polynucleotides of different lengths. 

60. The method according to Claim 33, for use in a task selected from the 
group consisting of: 

(a) detecting or genotyping of Single Nucleotide Polymorphisms; 

(b) detecting or genotyping of genetic syndromes or disorders; 

(c) detecting or genotyping somatic mutations; and 

(d) sequencing a polynucleotide whose function is related to the function of 
the reference polynucleotide. 

61 . The method according to Claim 33 , wherein polynucleotides contain gaps, 
or universal bases. 

62. The method according to Claim 33, wherein polypeptides are sequenced 
instead of polynucleotides. 

63. A program storage device readable by machine, tangibly embodying a 
program of instructions executable by the machine to perform method steps for 
obtaining a candidate nucleotide sequence, the candidate nucleotide sequence 
being indicative of a sequence of a target polynucleotide molecule T, T producing 
a hybridization signal l(x )\upon incubating T with a polynucleotide x for each 
polynucleotide x in a set E of polynucleotides, the method comprising the steps of: 

(a) for each polynucleotide x in the set E of polynucleotides, obtaining 
a probability of I(x) when the sequence x is not 

complementary toVa subsequence of T and a probability P\(x) of 
I(x) when the sequence x is complementary to a subsequence of T; 
so as to obtain a probabilistic spectrum (PS) of T; 



(b) assigning a spore to each of a plurality of candidate nucleotide 
sequences, the score being obtained in a calculation using the 

probabilistic spectrum and upon at least one reference nucleotide 
sequence H; and 

(c) selecting a candidate nucleotide sequence having an essentially 
maximal score. \ 

64. A computer program product comprising a computer useable medium 
having computer readable prograiA code embodied therein for obtaining a 
candidate nucleotide sequence, the candidate nucleotide sequence being indicative 
of a sequence of a target polynucleotide molecule T, T producing a hybridization 
signal I(x ) upon incubating T with a polynucleotide x for each polynucleotide x 
in a set E of polynucleotides, the computer program product comprising: 

(a) for each polynucleotide A in the set E of polynucleotides, computer 
readable program code For causing the computer to obtain a 
probability Po(*) of I(Jc) tlie sequence x is not complementary to a 
subsequence of T and a probability Pj(x)ofI(x) when the sequence 
x is complementary to a subsequence of T; 

(b) computer readable program coke for causing the computer to assign 
a score to each of a plurality oncandidate nucleotide sequences, the 
score obtained in a calculation using the probabilistic spectrum and 
at least one reference nucleotide sequence H; and 

(c) computer readable program code foY causing the computer to select a 
candidate nucleotide sequence havina an essentially maximal score. 



